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Abstract. In the study of deterministic distributed algorithms it is commonly 
assumed that each node has a unique 0(logn)-bit identifier. We prove that for 
a general class of graph problems, local algorithms (constant-time distributed 
algorithms) do not need such identifiers: a port numbering and orientation is 
sufficient. 

Our result holds for so-called simple PO-checkable graph optimisation problems; 
this includes many classical packing and covering problems such as vertex covers, 
edge covers, matchings, independent sets, dominating sets, and edge dominating 
sets. We focus on the case of bounded-degree graphs and show that if a local 
algorithm finds a constant-factor approximation of a simple PO-checkable graph 
problem with the help of unique identifiers, then the same approximation ratio 
can be achieved on anonymous networks. 

As a corollary of our result and by prior work, we derive a tight lower bound 
on the local approximability of the minimum edge dominating set problem. 

Our main technical tool is an algebraic construction of homogeneously ordered 
graphs: We say that a graph is (a, r) -homogeneous if its nodes are linearly ordered 
so that an a fraction of nodes have pairwise isomorphic radius-r neighbourhoods. 
We show that there exists a finite (a, r)-homogeneous 2/c-regular graph of girth 
at least g for any a < 1 and any r, k, and g. 
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Figure 1: Three models of distributed computing. 



1 Introduction 

In this work, we study deterministic distributed algorithms under three different 
assumptions; see Figure 1. 

(ID) Networks with unique identifiers. Each node is given a unique O(logn)- 
bit label. 

(01) Order-invariant algorithms. There is a linear order on nodes. Equiv- 
alently, the nodes have unique labels, but the output of an algorithm 
is not allowed to change if we relabel the nodes while preserving the 
relative order of the labels. 

(PO) Anonymous networks with a port numbering and orientation. For each 
node, there is a linear order on the incident edges, and for each edge, 
there is a linear order on the incident nodes. Equivalently, a node of 
degree d can refer to its neighbours by integers 1, 2, . . . , d, and each 
edge is oriented so that the endpoints know which of them is the head 
and which is the tail. 

While unique identifiers are often useful, we will show that they are seldom 
needed in local algorithms (constant-time distributed algorithms): there is a 
general class of graph problems such that local algorithms in PO are able to 
produce as good approximations as local algorithms in 01 or ID. 

1.1 Graph Problems 

We study graph problems that are related to the structure of an unknown 
communication network. Each node in the network is a computer; each computer 
receives a local input, it can exchange messages with adjacent nodes, and eventually 
it has to produce a local output. The local outputs constitute a solution of a 
graph problem — for example, if we study the dominating set problem, each node 
produces one bit of local output, indicating whether it is part of the dominating set. 
The running time of an algorithm is the number of synchronous communication 
rounds. 

From this perspective, the models ID, 01, and PO are easy to separate. 
Consider, for example, the problem of finding a maximal independent set in an 
n-cycle. In ID model the problem can be solved in 8 (log* n) rounds [6, 17], while 
in 01 model we need 0(n) rounds, and the problem is not soluble at all in PO, 
as we cannot break symmetry — see Figure 2. Hence ID is strictly stronger than 
01, which is strictly stronger than PO. 

1.2 Local Algorithms 

In this work we focus on local algorithms, i.e., distributed algorithms that run 
in a constant number of synchronous communication rounds, independently of 
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Figure 2: In ID, the numerical identifiers break symmetry everywhere — for 
example, a maximal independent set can be found in (9(log* n) rounds. In 01, 
we can have a cycle with only one "seam" , and in PO we can have a completely 
symmetric cycle. 

the number of nodes in the network [19, 22]. The above example separating ID, 
01, and PO no longer applies, and there has been a conspicuous lack of natural 
graph problems that would separate ID, 01, and PO from the perspective of local 
algorithms. 

Indeed, there are results that show that many problems that can be solved with 
a local algorithm in ID also admit a local algorithm in 01 or PO. For example, the 
seminal paper by Naor and Stockmeyer [19] studies so-called LCL problems — these 
include problems such as graph colouring and maximal matchings on bounded- 
degree graphs. The authors show that ID and 01 are indeed equally expressive 
among LCL problems. The followup work by Mayer, Naor, and Stockmeyer [18] 
hints of a stronger property: 

(i) Weak 2-colouring is an LCL problem that can be solved with a local 
algorithm in ID model [19]. It turns out that the same problem can be 
solved in PO model as well [18]. 

Granted, contrived counterexamples do exist: there are LCL problems that are 
soluble in 01 but not in PO. However, most of the classical graph problems that 
are studied in the field of distributed computing are optimisation problems, not 
LCL problems. 

1.3 Local Approximation 

In what follows, we will focus on graph problems in the case of bounded- degree 
graphs; that is, there is a known constant A such that the degree of any node in 
any graph that we may encounter is at most A. Parity often matters; hence we 
also define A' = 2[A/2j. 

In this setting, the best possible approximation ratios are surprisingly similar 
in ID, 01, and PO. The following hold for any given A > 2 and e > 0: 

(ii) Minimum vertex cover can be approximated to within factor 2 in each of 
these models [3, 5]. This is tight: (2 — e)-approximation is not possible in 
any of these models [9, 16, 22]. 

(iii) Minimum edge cover can be approximated to within factor 2 in each of 
these models [22]. This is tight: (2 — e)-approximation is not possible in 
any of these models [9, 16, 22]. 

(iv) Minimum dominating set can be approximated to within factor A' + 1 in 
each of these models [4]. This is tight: (A' + 1 — e)-approximation is not 
possible in any of these models [9, 16, 22]. 
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Figure 3: Graph % is a lift of Q. The covering map ip: V(H) — > V{Q) maps 
di <— ¥ a, hi i— > b, Ci i — y c, and di i— >• d for each i = 1, 2. The fibre of a G ^(^) is 
{oi, 02} C ^(H); all fibres have the same size. 

(v) Maximum independent set cannot be approximated to within any constant 
factor in any of these models [9, 16]. 

(vi) Maximum matching cannot be approximated to within any constant factor 
in any of these models [9, 16]. 

This phenomenon has not been fully understood: while there are many problems 
with identical approximability results for ID, 01, and PO, it has not been known 
whether these are examples of a more general principle or merely isolated co- 
incidences. In fact, for some problems, tight approximability results have been 
lacking for ID and 01, even though tight results are known for PO: 

(vii) Minimum edge dominating set can be approximated to within factor 4— 2/A' 
in each of these models [21]. This is tight for PO but only near-tight for 
ID and 01: (4 — 2/A' — e)-approximation is not possible in PO [21], and 
(3 — e)-approximation is not possible in ID and 01 [9, 16, 22]. 

In this work we prove a theorem unifying all of the above observations — they 
are indeed examples of a general principle. As a simple application of our result, 
we settle the local approximability of the minimum edge dominating set problem 
by proving a tight lower bound in ID and 01. 

1.4 Main Result 

A simple graph problem l"l is an optimisation problem in which a feasible solution 
is a subset of nodes or a subset of edges, and the goal is to either minimise or 
maximise the size of a feasible solution. We say that n is a PO-checkable graph 
problem if there is a local PO-algorithm A that recognises a feasible solution. That 
is, A(G, X,v) = 1 for all nodes v G V(Q) if X is a feasible solution of problem n in 
graph Q, and /\(Q,X,v) = for some node v G V{Q) otherwise — here /\(Q,X, v) 
is the output of a node v if we run algorithm A on graph Q and the local inputs 
form an encoding of X. 

Let ip: V{T~L) — > V{Q) be a surjective graph homomorphism from graph T~L 
to graph Q. If (p preserves vertex degrees, i.e., deg-^(u) = degg(ip(u)), then 99 is 
called a covering map, and % is said to be a lift of Q. The fibre of u G V{Q) is 
the set ip~ 1 (u) of pre-images of u. We usually consider n-lifts that have fibres of 
the same cardinality n. It is a basic fact that a connected lift % of Q is an n-lift 
for some n. See Figure 3 for an illustration. 

Let J 7 be a family of graphs. We say that J- is closed under lifts if Q G T 
implies % G J- for all lifts H of Q. A family is closed under connected lifts if 
Q G T implies fi G T whenever H and Q are connected graphs and ri is a lift 

of g. 
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Now we are ready to state our main theorem. 

Theorem 1 (Main Theorem). Let n be a simple PO-checkable graph problem. 
Assume one of the following: 

- General version: J 7 is a family of bounded degree graphs, and it is closed 
under lifts. 

- Connected version: J 7 is a family of connected bounded degree graphs, it 
does not contain any trees, and it is closed under connected lifts. 

If there is a local ID -algorithm A that finds an a -approximation of l~l in T, then 
there is a local PO-algorithm B that finds an a -approximation of l~l in T. 

While the definitions are somewhat technical, it is easy to verify that the 
result is widely applicable: 

(a) Vertex covers, edge covers, matchings, independent sets, dominating sets, 
and edge dominating sets are simple PO-checkable graph problems. 

(b) Bounded-degree graphs, regular graphs, and cyclic graphs are closed under 
lifts. 

(c) Connected bounded-degree graphs, connected regular graphs, and connected 
cyclic graphs are closed under connected lifts. 

1.5 An Application 

The above result provides us with a powerful tool for proving lower-bound results: 
we can easily transfer negative results from PO to 01 and ID. We demonstrate this 
strength by deriving a new lower bound result for the minimum edge dominating 
set problem. 

Theorem 2. Let A > 2, and let IK be a local ^-algorithm that finds an a-approx- 
imation of a minimum edge dominating set on connected graphs of maximum 
degree A. Then a > ao, where 

a = 4-2/A' and A' = 2[A/2j. 

This is tight: there is a local \ D-algorithm that finds an a^- approximation. 

Proof. By prior work [21], it is known that there is a connected A'-regular graph 
Qo such that the approximation factor of any local PO-algorithm on Qo is at least 
ao. Let Tq consist of all connected lifts of Qo, and let T consist of all connected 
graphs of degree at most A. We make the following observations. 

(a) We have To C T; by assumption, A finds an a-approximation in To- 

(b) Family To consists of connected graphs of degree at most A, it does not 
contain any trees, and it is closed under connected lifts. We can apply the 
connected version of the main theorem: there is a local PO-algorithm B 
that finds an a-approximation in To- 

(c) However, Qo G To, and hence a > ao- 

The matching upper bound is presented in prior work [21]. □ 
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1.6 Overview 



Informally, our proof of the main theorem is structured as follows. 

(a) Fix a graph problem n, a graph family J 7 , and an ID-algorithm A as in the 
statement of Theorem 1. Let r be the running time of ID-algorithm A. 

(b) Let Q 6 T be a graph with a port numbering and orientation. 

(c) Section 3.2: We construct a certain lift Q e £ J- of Q. Graph Q t inherits the 
port numbering and the orientation from Q. 

(d) Section 4.1: We show that there exists a linear order < t on the nodes of 
Q e that gives virtually no new information in comparison with the port 
numbering and orientation. If we have an 01-algorithm A' with running 
time r, then we can simulate A' with a PO-algorithm B' almost perfectly 
on Q t : the outputs of A' and B' agree for a (1 — e) fraction of nodes. We 
deduce that the approximation ratio of A' on T cannot be better than the 
approximation ratio of B' on T . 

(e) Section 4.2: We apply Ramsey's theorem to show that the unique identifiers 
do not help, either. We can construct a PO-algorithm B that simulates A in 
the following sense: there exists an assignment of unique identifiers on a lift 
% € J- of Q t such that the outputs of A and B agree for a (1 — e) fraction 
of nodes. We deduce that the approximation ratio of A on J 7 cannot be 
better than the approximation ratio of B on T . 

Now if graph Q was a directed cycle, the construction would be standard; see, 
e.g., Czygrinow et al. [9]. In particular, Q t and % would simply be long cycles, 
and < e would order the nodes along the cycle — there would be only one "seam" 
in (Q e , < e ) that could potentially help A' in comparison with B', and only an e 
fraction of nodes are near the seam. 

However, the case of a general Q is more challenging. Our main technical tool is 
the construction of so-called homogeneous graphs; see Section 3.1. Homogeneous 
graphs are regular graphs with a linear order that is useless from the perspective 
of 01-algorithms: for a (1 — e) fraction of nodes, the local neighbourhoods are 
isomorphic. Homogeneous graphs trivially exist; however, our proof calls for 
homogeneous graph of an arbitrarily high degree and an arbitrarily large girth 
(i.e., there are no short cycles — the graph is locally tree-like). In Section 5 we 
use an algebraic construction to prove that such graphs exist. 

1.7 Discussion 

In the field of distributed algorithms, the running time of an algorithm is typically 
analysed in terms of two parameters: n, the number of nodes in the graph, and 
A, the maximum degree of the graph. In our work, we assumed that A is a 
constant — put otherwise, our work applies to algorithms that have a running 
time independent of n but arbitrarily high as a function of A. The work by Kuhn 
et al. [12-14] studies the dependence on A more closely: their lower bounds on 
approximation ratios apply to algorithms that have, for example, a running time 
sublogarithmic in A. 

While our result is very widely applicable, certain extensions have been left 
for future work. One example is the case of planar graphs [9], [15, §13]. The 
family of planar graphs is not closed under lifts, and hence Theorem 1 does not 
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apply. Another direction that we do not discuss at all is the case of randomised 
algorithms. 

2 Three Models of Distributed Computing 

In this section we make precise the notion of a local algorithm in each of the 
models ID, 01 and PO. First, we discuss the properties common to all the models. 

We start by fixing a graph family T where every Q = (V(Q), E(Q)) G T has 
maximum degree at most A G N. We consider algorithms A that operate on 
graphs in J 7 ; the properties of A (e.g., its running time) are allowed to depend 
on the family T (and, hence, on A). We denote by A(Q,u) G ft the output of 
A on a node u G V(Q). Here, Q is a finite set of possible outputs of A in T. 
If the solutions to n are sets of vertices, we shall have Q = {0, 1} so that the 
solution produced by A on Q, denoted A(Q), is the set of nodes u with A(Q, u) = 1. 
Similarly, if the solutions to n are sets of edges, we shall have f2 = {0, 1} A so that 
the ith component of the vector A(Q,u) indicates whether the ith edge incident 
to u is included in the solution A{Q) — in each of the models a node will have a 
natural ordering of its incident edges. 

Let r G N denote the constant running time of A in J 7 . This means that a 
node u can only receive messages from nodes within distance r in Q, i.e., from 
nodes in the radius-r neighbourhood 

Bg(u,r) = {v G V(G) : distg(u,v) < r). 

Let t(Q,u) denote the structure (Q,u) restricted to the vertices Bg(u,r), i.e., in 
symbols, 

t{G,u) = (G,u) \ Bg(u,r). 

Then A(Q,u) is a function of the data t(Q, u) in that A(G,u) = A(t(G,u)). The 
models ID, 01 and PO impose further restrictions on this function. 

2.1 Model ID 

Local ID-algorithms are not restricted in any additional way. We follow the 
convention that the vertices have unique 0(logn)-bit labels, i.e., an instance 
Q G T of order n = \V(Q)\ has V{Q) C {1, 2, ... , s(n)} where s(n) is some fixed 
polynomial function of n. Our presentation assumes s(n) = o;(n), even though 
this assumption can often be relaxed as we discuss in Remark 4.1. 

2.2 Model 01 

A local 01-algorithm A does not directly use unique vertex identifiers but only 
their relative order. To make this notion explicit, let the vertices of Q G J- be 
linearly ordered by <, and call (Q, <) an ordered graph. Denote by t{Q, <,u) 
the restriction of the structure (Q, <,u) to the r-neighbourhood Bg(u,r), i.e., in 
symbols, 

t(G, <, u) = (G, <, u) \ Bg(u, r). 

Then, the output A(Q, <,u) depends only on the isomorphism type of t(Q, <,u), 
so that if t(Q,<,u) ~ t{G',<',u') then A(Q,<,u) = A(Q' , <',u'). 
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Figure 4: (a) A graph Q with a port numbering and an orientation, (b) A 
proper labelling £g that is derived from the port numbering. We have an 
L-digraph with L = {a, b, c}, a — (1, 2), 6 = (2. 1), and c = (3, 1). (c) The view 
of Q from u is an infinite directed tree T — T(G , u); there is a covering map 
tp from T to Q that preserves adjacencies, orientations, and edge labels. For 
example, ip(\) — ip^aab^ 1 ) = u. 



2.3 Model PO 

In the PO model the nodes are considered anonymous and only the following 
node specific structure is available: a node can communicate with its neighbours 
through ports numbered 1,2,..., deg(w), and each communication link has an 
orientation. 

Edge-Labelled Digraphs. To model the above, we consider L- edge-labelled 
directed graphs (or L-digraphs, for short) Q = (V(Q), E(Q),£g), where the edges 
E{Q) C V{Q) x V{Q) are directed and each edge e £ E(Q) carries a label 
£g(e) G L. We restrict our considerations to proper labellings ig : E(Q) — > L that 
for each u G V{Q) assign the incoming edges (v,u) G E(Q) distinct labels and 
the outgoing edges (u,w) G E{Q) distinct labels; we allow £g(v,u) = £g(u,w). 
We refer to the outgoing edges of a node by the labels L and to the incoming 
edges by the formal letters L^ 1 = {i^ 1 : £ G L}. In the context of L-digraphs, 
covering maps (p: V{Tl) — > V{Q) are required to preserve edge labels so that 
l H (u,v) = lg(y(u),v(v)) for all (u,v) G E{%). 

A port numbering on Q gives rise to a proper labelling £g(v,u) = where 
u is the ith neighbour of v, and v is the jth neighbour of u; see Figure 4. We 
now fix L to contain every possible edge label that appears when a graph Q G T 
is assigned a port numbering and an orientation. Note that \L\ < A 2 . 
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Figure 5: The complete i-labelled rooted directed tree (T* , A) of radius r = 2, 
for L = {a, 6}. 



Views. The information available to a PO-algorithm computing on a node 
u G V(£7) in an L-digraph is usually modelled as follows [1, 22, 23]. The 
view of from u is an L-edge-labelled rooted (possibly infinite) directed tree 
T = T(G,u), where the vertices V(T) correspond to all non-backtracking walks 
on Q starting at u; see Figure 4c. Formally, a fe-step walk can be identified with 
a word of length k in the letters L U A non-backtracking walk is a reduced 
word where neither nor appear. If w G ^(7") is a walk on Q from 
u to w, we define <p{w) = v. In particular, the root of T is the empty word A 
with y?(A) = u. The directed edges of T (and their labels) are defined in such a 
way that <p: V(T) — > V{G) becomes a covering map. Namely, w G V(7") has an 
out-neighbour wi for every ^ G L such that ip(w) has a outgoing edge labelled I. 

Local PO- Algorithms. The inability of a PO-algorithm B to detect cycles 
in a graph is characterised by the fact that B(G,u) = B(T(G ,u)). In fact, we 
define a local PO-algorithm as a function B satisfying B(Q,u) = B(t(T(G ,u))). 
An important consequence of this definition is that the output of a PO-algorithm 
is invariant under lifts, i.e., if <f>: V{%) —> V(G) is a covering map of L-digraphs, 
then B(H,u) = B(G , (p{u)). The intuition is that nodes in a common fibre are 
always in the same state during computation as they see the same view. 

The following formalism will become useful. Denote by (T* , A) the complete 
L-labelled rooted directed tree of radius r with V(T*) consisting of reduced 
words in the letters L U L^ 1 , i.e., every non-leaf vertex in T* has an outgoing 
edge and an incoming edge for each £ G L; see Figure 5. The output of B on 
every graph G G T is completely determined after specifying its output on the 
subtrees of (T*, A). More precisely, let 2U consist of vertex sets W C V(T*) such 
that (T*, A) \ W = t(T(G, u)) for some G G T and u G V(G)- Then a function 
B: 2U -»■ n defines a PO-algorithm by identifying B((T*, A) \ W) = B(W). 
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3 Order Homogeneity 



In this section we introduce some key concepts that are used in controlling the 
local symmetry breaking information that is available to a local 01-algorithm. 

3.1 Homogeneous Graphs 

In the following, we take the isomorphism type of an r-neighbourhood r = 
t(Q, <,u) to be some canonical representative of the isomorphism class of r. 

Definition 1. Let (H, <) be an ordered graph. If there is a set U C V{%) of 
size \U\ > a\H\ such that the vertices in U have a common r-neighbourhood 
isomorphism type t* , then we call (H, <) an (a, r) -homogeneous graph and r* 
the associated homogeneity type of H. 

Homogeneous graphs are useful in fooling 01-algorithms: an (a, ^-homoge- 
neous graph forces any local 01-algorithm to produce the same output in at 
least an a fraction of the nodes in the input graph. However, there are some 
limitations to how large a can be: Let (Q, <) be a connected ordered graph on at 
least two vertices. If u and v are the smallest and the largest vertices of Q, their 
r-neighbourhoods t(Q, <,u) and t(Q, <,v) cannot be isomorphic even for r = 1. 
Thus, non-trivial finite graphs are not (1, l)-homogeneous. Moreover, an ordered 
(2k — l)-regular graph cannot be (a, l)-homogeneous for any a > 1/2; this is the 
essence of the weak 2-colouring algorithm of Naor and Stockmeyer [19]. 

Our main technical tool will be a construction of graphs that satisfy the 
following properties: 

(1) (1 — e, r)-homogeneous for any e > and r, 

(2) 2/c-regular for any k, 

(3) large girth, 

(4) finite order. 

Note that it is relatively easy to satisfy any three of these properties: 

(1), (2), (3) Infinite 2/c-regular trees admit a (1, r)-homogeneous linear order; 
see Figure 6 for an example. 

(1), (2), (4) We can construct a sufficiently large /c-dimensional toroidal grid 
graph (cartesian product of k directed cycles) and order the 
nodes lexicographically coordinate-wise; see Figure 7 for an 
example. However, these graphs have girth 4 when k > 2. 

(1) , (3), (4) A sufficiently large directed cycle is (1 — e, r)-homogeneous and 

has large girth. However, all the nodes have degree 2. 

(2) , (3), (4) It is well known that regular graphs of arbitrarily high girth 

exist. 

Our construction satisfies all four properties simultaneously. 

Theorem 3. Let k,r G N. For every e > there exists a finite Ik-regular 
(1 — e,r) -homogeneous connected graph (H e ,< t ) of girth larger than 2r + 1. 
Furthermore, the following properties hold: 

(a) The homogeneity type r* of (T-L e , < e ) does not depend on e. 

(b) The graph % t and the type r* are k- edge-labelled digraphs. 
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Figure 6: A fragment of a 4-regular infinite ordered tree (Q, <). The numbering 
of the nodes indicates a (1, r)-homogeneous linear order in the neighbourhood 
of node 27; grey nodes are larger than 27 and white nodes are smaller than 27. 




Figure 7: A 4-regular graph Q constructed as the cartesian product of two 
directed 6-cycles. We define the ordered graph (Q, <) by choosing the linear 
order If < 12 < • • • < 16 < 21 < 22 < • • • < 66. The radius-1 neighbourhood 
of node 25 is isomorphic to the radius-1 neighbourhood of node 42. In general, 
there are 16 nodes (fraction 4/9 of all nodes) that have isomorphic radius-1 
neighbourhoods; hence (Q, <) is (4/9, l)-homogeneous. It is also (1/9,2)- 
homogeneous. 
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We defer the proof of Theorem 3 to Section 5. There, it turns out that 
Cayley graphs of soluble groups suit our needs: The homogeneous toroidal graphs 
mentioned above are Cayley graphs of the abelian groups Z^. Analogously, we 
use the decomposition of a soluble group into abelian factors to guarantee the 
presence of a suitable ordering. However, to ensure large girth, the groups we 
consider must be sufficiently far from being abelian, i.e., they must have large 
derived length [7] . 

3.2 Homogeneous Lifts 

We fix some notation towards a proof of Theorem 1. By Theorem 3 we let 
(H e ,< e ), e > 0, be a family of 2|L|-regular (1 — e, r)-homogeneous connected 
graphs of girth > 2r + 1 interpreted as L-digraphs. The homogeneity type r* 
that is shared by all Ti e is then of the form r* = (T*, <*, A), where T* is the 
complete L- labelled tree of Section 2.3. 

We use the graphs % e to prove the following theorem. 

Theorem 4. Let Q be an L-digraph. For every e > there exists a lift (Q e , <g e ) 
of Q such that a (1 — e) fraction of the vertices in (Q e , <g t ) have r -neighbourhoods 
isomorphic to a subtree of r* = (T*,<*,A). Moreover, if Q is connected, Q e can 
be made connected. 

Proof. Write (C,<c) = (Ge,<ge) and {H,<h) = (He,<e) f° r short. Our goal 
is to construct (C, <c) as a certain product of (H, <y) and Q; see Figure 8. 
This product is a modification of the common lift construction of Angluin and 
Gardiner [2]. 

The lift C is defined on the product set V(C) = V{%) x V{Q) by "matching equi- 
labelled edges": the out-neighbours of (h,g) G V(C) are vertices (h',g') G V(C) 
such that (h,h r ) G E(H), (g,g') G E(Q) and £ n (h,h') = £g(g,g'). An edge 
((h, g), (h', g')) G E(Q) inherits the common label £n(h, h') = £g(g, g'). 

The properties of C are related to the properties of Q and H as follows. 

(a) The projection ipg : V{C) — > V{Q) mapping (h, g) t- > g is a covering map. 
This follows from the fact that each edge incident to g G V{Q) is always 
matched against an edge of H in the fibre V(H) x {g}. 

(b) The projection ipu'- V(C) — > V(H) mapping (h,g) \— > h is not a covering 
map in case Q is not 2|L|-regular. In any case ipu is a graph homomorphism, 
and this implies that C has girth > 2r + 1 . 

Next, we define a partial order < p on V(C) as u < p v <^=^ <fn{ u ) <H 
for u, v G V(C). Note that this definition leaves only pairs of vertices in a common 
(,2%-fibre incomparable. But since % has large girth, none of the incomparable 
pairs appear in an r- neighbourhood of C. We let <c be any completion of < p into 
a linear order. The previous discussion implies that <c satisfies r(C, <c,u) = 
t(C,< p ,u) for all u G V(C). 

Let U H C V(H), \U H \ > (1 - e)\H\, be the set of type r* vertices in (H, <%)■ 
Set Uq = <p^(Ufi) so that \Uq\ > (1 — Let u G Uq- By our definition of 

< p , ipu maps the r-neighbourhood t u = r(C, <c,u) into t(H, <-Ht l Ph( u )) — T * 
while preserving the order. But because r* is a tree, tpu must be injective on the 
vertex set of t u so that t u is isomorphic to a subtree of r* as required. 

Finally, suppose Q is connected. Then, by averaging, some connected compo- 
nent of C will have vertices in Uq with density at least (1 — e). This component 
satisfies the theorem. □ 
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Figure 8: Homogeneous lifts. In this example L = |2|, and the two labels 
are indicated with two different kinds of arrows. Graph T-L t is a homogeneous 
2|L|-regular ordered L-digraph with a large girth — in particular, the local 
neighbourhood of a node looks like a tree. Graph Q is an arbitrary L-digraph, 
not necessarily ordered. Their product Q e is a lift of G, but it inherits the 
desirable properties of Ji e : a high girth and a homogeneous linear order. 
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4 Proof of Main Theorem 



Next, we use the tools of the previous section to prove Theorem 1. For clarity of 
exposition we first prove Theorem 1 in the special case where A is an 01-algorithm. 
The subsequent proof for an I D-algorithm A uses a somewhat technical but well- 
known Ramsey type argument. 

4.1 Proof of Main Theorem for 01-algorithms 

We will prove the general and connected versions of Theorem 1 simultaneously; 
for the proof of the connected version it suffices to consider only connected lifts 
below. We do not need the assumption that T does not contain any trees. 

Let n be as in the statement of Theorem 1. Suppose an 01-algorithm A finds 
an a-approximation of l~l in T . We define a PO-algorithm B simply by setting 
for W G 2XT, 

B(W) = A((T*,<*,A) \W). 

Now, Theorem 4 translates into saying that for every Q G T and e > we have 
that A(G t , <ge, u ) = B(Q e ,u) for at least a (1 — e) fraction of nodes u G V{Q e ). 
The claim that B works as expected follows essentially from this fact as we argue 
next. 

For simplicity, we assume the solutions to l~l are sets of vertices so that 
A(Q) C V(Q); solutions that are sets of edges are handled similarly. 

Fix Q G T and let ip t : V{Q e ) — > V(Q), e > 0, be the associated covering maps. 



Algorithm B Finds a Feasible Solution of n on Q. Let V be a local PO- 
algorithm verifying the feasibility of a solution for n ; we may assume V also runs 
in time r. For e > sufficiently small, each v G V{Q) has a pre-image v' G Lp~ l {v) 
such that A and B agree on the vertices {j ve v(Q) Bg e (v' , r). Thus, V accepts the 
solution B(Q e ) on the vertices v'. But because ip e ({v' : v G V{Q)}) = V(Q) it 
follows that V accepts the solution B(Q) = ip e (B(Q e )) on every node in Q. 



Algorithm B Finds an a- Approximation of n on Q. We assume l~l is 
a minimisation problem; maximisation problems are handled similarly. Let 
X C V(Q) and X e C V(Q € ) be some optimal solutions of n. 

As e — > 0, the solutions B(Q e ) and A(Q e ) agree on almost all the vertices. 
Indeed, a simple calculation shows that |B(C/ e )| < /(e) • |A(C7 e )| for some / with 
/(e) — t- 1 as e —7- 0. Furthermore, 

\B(G)\ \^(B(G))\ ^ |B^)| ^ /(6)-|A(£)| ^ /(e)Qj 



\X\ \ V 7 l (X)\ ~ \X e \ ~ \X e \ 

where the first equality follows from (p t being an n-lift, and the first inequality 
follows from ^7 1 (B(^)) = B(Q e ) and the fact that ipJ 1 (X) is a feasible solution 
so that \X e \ < \ip~ 1 (X)\. Since the above inequality holds for every e > we 
must have that |B(^)|/|X| < a, as desired. 



4.2 Proof of Main Theorem for ID-algorithms 

We extend the above proof to the case of local ID-algorithms A by designing 
"worst-case" vertex identifiers for the instances in T in order to make A behave 
similarly to a PO-algorithm on tree neighbourhoods. To do this we use the 
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Ramsey technique of Naor and Stockmeyer [19]; see also Czygrinow et al. [9]. For 
a reference on Ramsey's theorem see Graham et al. [11]. 

We use the following notation: if (X, <x) and (Y, <y) are linearly ordered sets 
with \X\ < \Y\, we write /: (X, <x) ^ (Y, <y) for the unique order-preserving 
injection /: X — > Y that maps the ith element of X to the iih element of Y. A 
t-set is a set of size t, and the set of t-subsets of X is denoted X^\ 

Write fl w for the family of functions 2H — > Q; recall that each B G fl w can 
be interpreted as a PO-algorithm. Set k = and t = |T*|. We consider every 
t-subset A G to be ordered by the usual order < on N. For W G 2B we let 
fw,A ■ (W, <*) (A, <) so that the vertex-relabelled tree fw,A((T*, A) [ W) has 
the | IF | smallest numbers in A as vertices. Define a fc-colouring c: — > £l w 
by setting 

c(A)(W) = A(f W:A ((T*,\)\W)). 

For each to > i we can use Ramsey's theorem to obtain a number > m, 

so that for every R(m)-set I C N there exists an m-subset J C / such that 
is monochromatic under c, i.e., all t-subsets of J have the same colour. In 
particular, for every interval 

I(m, i) = [(i - l)R{m) + 1, iR(m)}, i > 1, 

there exist an m-subset J(m,i) C I(m,i) and a colour (i.e., an algorithm) 
^>m,i G f^ 211 such that c(A) = B mj j for all t-subsets A C J(m, i). 
This construction has the following property. 

Proposition 5. Suppose m > \Q e \ + t. Algorithms A and B mj j produce the 
same output on at least a (1 — e) fraction of the vertices in the vertex-relabelled 
L-digraph f mt i(Q t ), where 

f m ,i - (V(g e ),<g e ) (J(m, »),<)■ 

Proo/. By Theorem 4, let C/ C V(/ m ,i(^ e )). 1^1 > (1 — e)|^ e |, be the set of vertices 
u with T(f mj i(g e ), <, u) isomorphic to a subtree of r*. In particular, for a fixed 
w G U we can choose IF G 2B such that 

r(/ m ,,(g e ),<^)~(T,<*,A) \W. 

Now, as m is large, there exists a t-set A C J(m, i) such that 

T(fm,i(Ge),u) = fw,A((T*,X) \ W). 

Thus, A and B m> j agree on u by the definition of B m ^. □ 

For every n G N some colour appears with density at least 1/k (i.e., appears at 
least n/k times) in the sequence B m> i, B mj 2, • ■ , B mj „. Hence, let B m be a colour 
that appears with density at least 1/k among these sequences for infinitely many 
n. Let B be a colour appearing among the B m for infinitely many m. We claim 
B satisfies Theorem 1. In fact, Theorem 1 follows from the following proposition 
together with the considerations of Section 4.1. 

Proposition 6. For every Q e there exists an n-lift % of Q e such that V(%) C 
{1,2,..., s(|%|)} and A(%, u) = B(H, u) for a (1 — e) fraction of nodes u G V(H). 
Moreover, if Q e is connected and not a tree, % can be made connected. 
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Proof. Let m be such that m > \Q e \ + 1 and B = B m . For infinitely many n there 
exists an n-set / C [nk] of indices such that B = B mi j for i £ I. Consider the 
following n-lift of Q t obtained by taking disjoint unions: 

w = U/«m(&)- 

iei 

Algorithms A and B agree on a (1 — e) fraction of the nodes in % by Proposition 5. 
Furthermore, we have \H\ = n\Q e \ and V(H) C {1,2, ... ,nkR(m)}. We are 
assuming that s(n) = w(n) so choosing a large enough n proves the non-connected 
version of the claim. 

Finally, suppose Q e is connected and not a tree. We may assume that there is 
an edge e = (u, v) G E{Q e ) so that <5 e remains connected when e is removed and 
that a (1 — e) fraction of vertices in Q e have r- neighbourhoods not containing e 
that are isomorphic into r*. Now % above is easily modified into a connected 
graph by redefining the directed matching between the fibre {uj}j G / of u and the 
fibre {vi}i£i of v. Namely, let n be a cyclic permutation on I and set 

E' = (E{H) \ {{ui,Vi)} ieI ) U {( Ui ,v Hi) )} ieI . 

Then %' = (V(H),E') is easily seen to be a connected n-lift of Q e satisfying the 
claim. □ 

Remark 4.1. Above, we assumed that instances Q have node identifiers V{Q) C 
{1,2,..., s(n)}, n = |^|, for s(n) = w(n). By choosing identifiers more economi- 
cally as in the work of Czygrinow et al. [9] one can show lower bounds for the 
graph problems of Section 1.3 even when s(n) = n. 

5 Construction of Homogeneous Graphs of Large 
Girth 

In this section we prove Theorem 3. Our construction uses Cayley graphs of 
semi-direct products of groups. First, we recall the terminology in use here; for a 
standard reference on group theory see, e.g., Rotman [20]. 

For the benefit of the reader who is not well- versed in group theory we include 
in Appendix A a short primer on the semi-direct product groups that are used 
below. 

5.1 Semi-Direct Products 

Let G and H be groups with H acting on G as a group of automorphisms. We 
write h ■ g for the action of h G H on g G G so that the mapping g h-> h ■ g is 
an automorphism of G. The semi-direct product G x H is defined to be the set 
G x H with the group operation given by 

(g,h){g',ti) = (g(h.g'),hti). 

5.2 Cayley Graphs 

The Cayley graph C(G, S) of a group G with respect to a finite set S C G is an 
S*-digraph on the vertex set G such that each g G G has an outgoing edge (g, gs) 
labelled s for each s G S. We require that 1 G" S so as not to have any self-loops. 
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We do not require that S is a generating set for G, i.e., the graph C(G, S) need 
not be connected. 

If (p : H — > G is an onto group homomorphism and S C H is a set such that 
the mapping ip is injective on S U {1}, then (p naturally induces a covering map 
of digraphs C(H, S) and C(G, <p(S)). 

5.3 Proof of Theorem 3 

Let n £ N be an even number. We consider three families of groups, {-Hi}i>i, 
{Wi}i>i, and {C/j}i>i, that are variations on a common theme. The families are 
defined iteratively as follows: 

H x = Z„, = Z 2 , l/i = Z, 

= i/? x Z n , = W? x Z 2 , I7 i+1 = Uf x z. 

Here, the cyclic group Z ra = {0, 1, . . . , n — 1} acts on the direct product Hf = 
Hi x Hi by cyclically permuting the coordinates, i.e., the subgroup 2Z n < Z ra 
acts trivially and the elements in 1 + 2Z„ swap the two coordinates. The groups 
Z 2 and Z act analogously in the definitions of Wi and C/j. See Appendix A for 
more information on groups Hi, Wi, and Ui. 

The underlying sets of the groups Hi, Wi, and Ui consist of (i(i)-tuples of 
elements in Z, for d(i) = 1 % — 1, so that Wi C Hi C Ui as sets. Interpreting these 
tuples as points in M d W we immediately get a natural embedding of every Cayley 
graph of these groups in R d W. This geometric intuition will become useful later. 

(a) The groups Wi are i-fold iterated regular wreath products of the cyclic 
group Z 2 . These groups have order \Wi\ = 2 d W and they are sometimes 
called symmetric 2-groups; they are isomorphic to the Sylow 2-subgroups 
of the symmetric group on 2 Z letters [20, p. 176]. 

(b) The groups Ui are natural extensions of the groups Wj by the free abelian 
group of rank d{i): the mapping Ui — > Wi that reduces each coordinate 
modulo 2 is easily seen to be an onto homomorphism with abelian kernel 

(2Z) d W ~ Z d W. 

(c) The groups Hi are intermediate between Ui and Wi in that the mapping 
ipi'. Ui —> Hi that reduces each coordinate modulo n is an onto homomor- 
phism, and the mapping p\ : Hi — >■ Wi that reduces each coordinate modulo 
2 is an onto homomorphism. In summary, the following diagram commutes: 




Our goal will be to construct a suitable Cayley graph % of some Hi. We 
will use the groups Wi to ensure H has large girth, whereas the groups Ui will 
guarantee that H has an almost-everywhere homogeneous linear ordering. 



Girth. Gamburd et al. [10] study the girth of random Cayley graphs and prove, 
in particular, that a random /c-subset of Wi generates a Cayley graph of large 
girth with high probability when i 3> k is large. We only need the following 
weaker version of their theorem (see Appendix B for an alternative, constructive 
proof). 
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Theorem 7 (Corollary to [10, Theorem 6]). Let k,r G N. There exists an i G N 
and a set S C Wj, | *S f | = k, such that the girth of the Cayley graph C(Wi, S) is 
larger than 2r + 1 . 

Fix a large enough j 6 N and a k-set S C Hj so that C(Wj, S) has a girth 
larger than 2r + 1. Henceforth, we omit the subscript j and write H, W, U, (p, tp 
and d in place of Hj, Wj, Uj, tpj, ipj and d(j). Interpreting S as a set of elements 
of H and U (so that = i/)(S) = S) we construct the Cayley graphs 

U = C{H,S), W = C(W,S), and U = C(U,S). 

As each of these graphs is a lift of W, none have cycles of length at most 2r + 1 
and their r-neighbourhoods are trees. 

Linear Order. Next, we introduce a left-invariant linear order < on U satis- 
fying 

u < v =>• wii < wv, for all u,v,w G £7. 

Such a relation can be defined by specifying a positive cone P C {7 of elements 
that are greater than the identity 1 = 1^ so that 

u < v <J=^ 1 < it" 1 !; <^=^ u~ 1 v G P. 

A relation < defined this way is automatically left-invariant; it is transitive iff 
u,v G P implies uv G P; and every pair u 7^ v is comparable iff for all it; ^ 1, 
either it; G P or it;" 1 G P. The existence of a P satisfying these conditions 
follows from the fact that U is a torsion-free soluble group (e.g., [8]), but it is 
easy enough to verify that setting 

P = {(1x1,1x2, ■ ■ ■ ,iH,0,0, . . . ,0) G U : 1 < i < d and iXj > 0} (1) 

satisfies the required conditions above (see Appendix A. 3). 

Because U acts (by multiplication on the left) on IA as a vertex-transitive 
group of graph automorphisms, it follows that the structures (U, <,u), u G U , 
are pairwise isomorphic. A fortiori, the r-neighbourhoods t(U, <,u), u G U, are 
all pairwise isomorphic. Let r* be this common r-neighbourhood isomorphism 
type. 

Transferring the Linear Order on U to H. Let V{%) be ordered by 
restricting the order < on U to the set V(H) = underlying the group H. Note 
that < is not a left-invariant order on H (indeed, no non-trivial finite group can 
be left-invariantly ordered). Nevertheless, we will argue that, as n — > 00, almost 
all it G V(%) have r-neighbourhoods of type r*. 

The neighbours of a vertex tx G V(14) are elements us where s£SU S^ 1 C 
[—1, l] d . The right multiplication action of s G S U S*" 1 on u can be described 
in two steps as follows: First, the coordinates of s are permuted (as determined 
by ix ) to obtain a vector s'. Then, us is given as the standard addition of the 
vectors u and s' in Z d C R d . Hence, us G u + [—1, l] d , and moreover, 

B u (u,r) Cu+[-r,r] d . (2) 

This means that vertices close to ix in the graph U are also close in the associated 
geometric IR d -embedding. 
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Consider the set of inner nodes I = [r, (n — 1) — r] d . Let u £ I. By (2), 
the vertex set Bu(u,r) is contained in Z^. This implies that the cover map 
ifr is the identity on Bu(u,r) and consequently the r-neighbourhood t(H, <,u) 
contains the ordered tree t(U, <, u) ~ r*. If r (H,<, u) had any additional edges 
to those of t(U, <,u), this would entail a cycle of length < 2r + 1 in T-L, which 
is not possible. Thus, t(T~L, <,u) ~ r*. The density of elements in % having 
r-neighbourhood type r* is therefore at least \I\/\H\ = (n- 2r) d /n d > 1 - e, for 
large n. 

Finally, to establish Theorem 3 it remains to address Ws connectedness. 
But if TL is not connected, an averaging argument shows that some connected 
component must have the desired density of at least (1 — e) of type r* vertices. 
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A Groups of Section 5 in More Detail 

This appendix contains expository material on the structure and properties of 
groups Hi, Wi, and Ui for the convenience of the reader who is not familiar 
with semi-direct products. The notation we introduce here will become useful in 
Appendix B. 

A.l Binary Trees 

We can interpret the elements of groups Hi, Wi, and Ui as complete binary trees 
of height i: there are i levels of internal nodes, one level of leaf nodes, and all 
internal nodes have two children. The number of internal nodes is d(i) = 2* — 1. 

Moreover, there is some data associated with the internal nodes: in Hi the 
internal nodes are labelled with the elements of Z n , in Wj they are elements of 
Z2, and in Ui they are elements of Z. 

A.2 Group Ui 

We will now focus on the case of group Uf, the other two cases are similar. 
Wherever reasonable, we will use the convention that x,y, . . . G Z, A,B, . . . G Ui, 
and a, (3, . . . G Uj for some j < i. 

Base Case. We will use the symbol o to denote a binary tree of height 0. 
Group Uq is the trivial group Uq = {o}. 

Recursive Step. Now assume that we have defined group E/j-i; we proceed 
to define group Ui. Elements of group Ui are triples (a,(3,x), where a,(3e Ui-\ 
and x G Z. Intuitively, (a, (3, x) is a tree with the element x as the root node, a 
as the left subtree, and (3 as the right subtree. 

In what follows, we will write x for an even 'x G Z and x for an odd x G Z. 
Group Z acts on Ui-\ x as follows, depending on the parity: 

x • [a, 0) = (a, f3), 
x • (a, (3) = (f3, a). 

Hence by the definition of the semi-direct product, the group operation in U is 

(a,/3,x)(7,5,y) = (aj, /35, x + y), 

(a,/3,x)('j,S,y) = (aS, x + y), 

(a,P,x)(i,8,y) = (0:7, /36, x + y), 

{a,f3,x){<y,8,y) = (aS, fij, x + y). 

Shorthand Notation. To make this a bit easier to approach, let us define 
some shorthand notation: 

h = °, 

Ii = (!;_!,/;_!,()), 

Xi(x) = (7j_i,/j_i,x), 
[a, /3] = (a, (3,0). 
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In particular, I{ G Ui is an empty tree of height i: all internal nodes are labelled 
with zeroes. Observe that U is the identity element of Uf. 

(a,(3,x)Ii = {a,P,x) = Ii(a,P,x). 

Moreover, we can express any element as a product of [■, •] and Xf. 

[a,P]Xi(x) = (a,P,x). 

We will omit the subscript i when it is clear from the context. 

Examples of Group Operations. Now the group operations are much more 
straightforward: 

[<*,P] [7><S] = [<*!/, P$\, 
X(x)X(y)=X(x + y), 
X(x) [q,/3] = [a,P]X(x), 
X(x) [a,P] = \p,a]X(x), 

X(x)~ 1 =X(-x). 
Here are some further examples: 

{ a ,P,x)- 1 = {[a,P]X{x)y 1 

= X(x)- 1 [a,(3]- 1 

= X(-x) [a'\p-\ 
(a,P,x)- 1 = {cT 1 ,p- 1 ,-x), 
(a,P,x)- 1 = {p- 1 ,or 1 ,-x). 

If A G Ui, we can interpret the left multiplication as follows from the perspective 
of trees: 

- X(x) A: increment the label of the root node by x. 

- X{x) A: exchange the left and the right subtree of A, and increment the 
label of the root node by x. 

[a, f3] A: recursively apply a to the left subtree of A and (3 to the right 
subtree of A. 

A. 3 Positive Cone 

Let us now have a closer look at the definition of a positive cone P in (1). The 
properties of P are easy to verify if we consider the following alternative, recursive 
definition. For each i > 0, we say that A = (a, f3, x) G U is positive if one of the 
following holds: 

(a) x > 0, 

(b) x = and (3 is positive, 

(c) x = and (3 = and a is positive. 

Finally, element Iq G Uq is not positive. Let Pj C Ui consist of all positive 
elements of C7j. Let us prove the following properties. 
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Lemma 8. For any i and A G Ui, precisely one of the following is true: A = Ii, 
A G Pi, or A' 1 G p. 

Proof. The proof is by induction. The case i = is trivial: Uq = {Io}, and 
Iq 1 = Io is not positive by definition. 

Now assume that i > and A = (a, /3,x). If x ^ 0, we have A^ 1 = (7, 5, —x) 
for some 7, S € Ui-\. Therefore we have either (i) A 7^ Ii, A G P, and v4 _1 ^ P, 
or (ii) A I u A (£ P, and A' 1 G P. 

Otherwise x = 0, and we have A -1 = (a -1 , 0). The claim follows by a 
simple case analysis. □ 

Lemma 9. If A,B G p, we ftaue AB G p. 

Proof. The proof is by induction. The claim is trivial if i = 0. Otherwise, let ^4 = 
(cti, fti,x) and P = (02, fii,y)- If x > or y > 0, we have ^4P = (a, )3,x + y) G P«, 
as x + y > 0. Otherwise x = y = and ^4P = (0:10:2, /?i/?2, 0), and the claim 
follows by a simple case analysis. □ 

Hence if we define a relation < on Ui by A < B <s=^- A~ 1 B € Pi, we have 
the following properties: 

(a) A < B and B < C implies A < B < C, 

(b) A < B implies CA < CB, and 

(c) for all A,B £ Ui precisely one of the following is true: A = B, A < B, or 
A > B. 

That is, < is a left-invariant linear order. 
A.4 Group Wi 

Let us now compare Wi with U. The case of Wi is further simplified, as we 
have only two possible values of x G {0, 1}. Moreover, X(0) = I is the identity 
element; hence there is only one non-trivial case, X(l): 



Intuitively, an element A G Wi is simply an automorphism of a complete binary 
tree of height i: for example, X(l) is an automorphism that exchanges the left 
and the right subtree of the root node. The group operation is, in essence, the 
composition of automorphisms. 

More Shorthand Notation. The following notation will prove useful in Ap- 
pendix B. First, let us extend the bracket notation to cover an arbitrary vector 
of length I = 2 k : 



Then, we extend the function X in an analogous fashion. For any 1 = 2, we 
define 



X(l) [a,p] 
X(l)- 1 
(a.0,0)- 1 
(a, P, I)" 1 



[(3,a]X(l), 



(a-\/r\o), 
(/r\a-\i). 



[01,02,03,04] 

[a>i,a 2 , • • • ,a 8 ] 



[[ai,a 2 ], [03,04]], 
[[oi,a 2 ], [03,04]], [[a 5 ,a 6 ], [a 7 ,a 8 ]] 



X i (x 1 ,x 2 , ...,x e ) = [Xi_ k (xi), Xi_ k (x 2 ), Xi_ k (x e )]. 
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and 



Xi(r; i) = Xi(yi,y 2 , ■ ye), where y r = 1 and yj = for all j / r. 
For example, 



Intuitively, Xi(xi,x 2 , ■ ■ ■ , xg) is a complete binary tree of height i that is mostly 
empty: the nodes at depth k have values xi,x 2 , ■ ■ ■ , X£, but all other nodes have 
value 0. 

Examples of Group Operations. We conclude this section with the following 
examples of group operations in Wf. 



X(x!,x 2 , ■ ■ .,xe)X(y 1 ,y 2 , . . . ,ye) = X(x 1 +y u x 2 + y 2 , x e + y e ), 
X(xi,X2, ■ ■ ■ ,Xi) [ai,a 2 , . . . ,ae] = [X(xi)ai, X(x 2 )a 2 , X(xe)ae]. 

In particular, X(r;£) exchanges the rth pair of subtrees: 

X(3; 4) [ai, fa, a 2 , /3 2 , a 3 , fa, a 4 , Pa] = [ai, Pi, a 2 , P 2 , P3, «3, «4, /34]^(3; 4). 

B Constructing High-Girth Generators 

In this appendix, we give an alternative proof of Theorem 7. Our proof borrows 
many ideas from the original proof of Gamburd et al. [10]. However, while they 
use the probabilistic method to prove that a set S exists, we give a simple explicit 
construction. In terms of the asymptotic growth of girth, this version is weaker 
but sufficient for our purposes. We will use the notation defined in Appendix A; 
see Figure 9 for an illustration. 

B.l Preliminaries 

Fix integers g and m. For i = 0, 1, . . . , g, define 



X(3;4) = X(0,0,1,0)= [[X(0),X(0)}, [X (1) , X (0)]] . 



X(xi,x 2 , ...,xg) 1 = X(xi,x 2 , ■ ■ -,xe), 



m 

h{i) 
n(i) 



m + g — i, 

i(2m + 2g -i + l)/2, 
2 /(0. 



Note that we have 




°i = w h(ti)+i = w m-i)+i x w h{i _ 1)+1 x ... x w h{i _ 1)+1 . 



As h(i — 1) + 1 + log 2 n{i) = h(i), we can interpret G\ as a subgroup of W^y. it 
consists of elements of the form 



[ai,a 2 , a n(i )] G W h ^, ai G W h(i _i) + i. 



For each i = 1, 2, . . . , g and k < n(i) we define the projections : Gi — > W^i_i^ + i 



by 



Pi([ai,a 2 , . . . ,a n( i)]) = a k . 
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B.2 Complex Words 



Let Si C W/j(j) . A word in Si is a product of the form 

such that Aj 6 Si and <t(j) £ { — 1,1} for each j. We say that ^ is the length of 
the word. The word w vanishes if we have w = I^u) in group W^uy 

The word is reduced if = Aj + i implies <j(i) = <r(i + 1). That is, in a 
reduced word we do not have a subword of the form BB^ 1 or B~ 1 B, B G Si. 
In the Cayley graph C(H / 7 l (j), Si), a walk of length ^ corresponds to a word of 
length £, a non-backtracking walk corresponds to a reduced word, and a cycle 
corresponds to a reduced walk that vanishes. 

Let X = {Ai, A2, . . . , A#}. We say that the complexity of the word w is 
I — \X\. For example, if L\, L2, and L3 are distinct elements of Si, then 

(a) the complexity of L1L2-L3 and L^L^, is 0, 

(b) the complexity of L3L3, and LiL^L^ 1 is 1, 

(c) the complexity of L1L1L2L2 and L\L^L^ 1 L^ 1 is 2. 

Obviously, the complexity of any word of length g is at most g — 1. 

We say that a word uj is k-complex if it is reduced and its complexity is at 
most k. For example, a 2-complex word is always a 3-complex word as well. 

B.3 Overview 

We will construct sets S± , S2 , ■ ■ ■ , S g such that 

(a) Si C d, 

(b) \Si\ = n(i), 

(c) (i — l)-complex words in Si do not vanish. 
It follows that 

(a) S g CG g , 

(b) |3,| = 2™ 

(c) reduced words of length 1 < I < g in S g do not vanish. 

It follows that the girth of CiW^gy S g ) is larger than g. In particular, Theorem 7 
follows by choosing g = 2r + 1 and m > log 2 k. Our construction satisfies 

g + m > \/%) > ^/log 2 log 2 

B.4 Base Case 

Construction. The base case of our construction, Si C G± is straightforward. 
Note that we have h(0) + 1 = 1; hence G x = IF" (1) . Recall that W x = {I,X(1)}. 
We define 

Lj = X hW (j;n(l)), 

Si = {Li,L 2 , . . . ,i„(i)}. 

Observe that we have the projections 

p k l {L 3 )=p k l (L- j 1 )=I, k*j, 
p k (L J )=p k (Lj 1 )=X(l), k = j. 
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Correctness. Now assume that w is a 0-complex word in Si. Then there is at 
least one £ Si that occurs in w; moreover, occurs only once. It follows 
that Pi(w) = X(l) ^ I, and hence w does not vanish in Gi. 

B.5 Recursive Step 

Construction. Let i > 2. Assume that we have already constructed a set Si-i 
such that (i — 2)-complex words in Si-i do not vanish in VF^i-i). We proceed to 
construct Si C W^uy 

We have \Si-i\ = 2n(i); hence we can label the elements of Si-i by jj and Sj 
where j = 1, 2, . . . , n(i). We define 

K 3 = [7j, 5 i> 7j, • • • , 7j, 

Si = {Li,L 2 , . . . , L n (i)}. 
Observe that we have the projections 





= hj,Sj], 




pHLj) 


= hj,8j], 




Pi(Lj) 


= h j ,s j ]x(i), 


k = j 


pHlj 1 ) 


= hj\sj\ 


k + j 


pf(Lj l ) 


= x(i) hj\6-\ 


k = j 



Intuition. We will prove that a (i — l)-complex word w in Si does not vanish 
in W/j/j). We will argue that there is at least one projection pf(w) 7^ /. To gain 
some intuition, let us begin with some examples. 
First, let w = L1L2L3L1. Now 

pj(w) = [71, 61} X(l) [ 72 , 5 2 ] [73, S 3 ] [ji,Si] X(l) 

= hl6 2 6 3 6i, ^l72737l] , 

Pi( w ) = [71, <5i] [72, S 2 ] X(l) [73, S 3 ] [71, Si] 

= [7172^3^1, Si5 2 ^i]X(l), 
Pi( w ) = [7i> *i] [72, S 2 ] [73, S 3 ] X(l) [71, Si] 

= [717273^1, SiS 2 5 3 -fi] X(l), 
Pi( w ) = [71, *i] [72, S 2 ] [73, S 3 ] [71 A] 

= [71727371 , 61626361]. 

In particular, w is a word of length 4 in Si, and it follows that all projections are 
of form [wi, w 2 ] X(x), where wi and w 2 are words of length 4 in Si-i. Moreover, 
as the original word w was 1-complex, wi and w 2 are also 1-complex: each Lj 
contributes precisely one Sj or 7^ in wi and w 2 . Projection pf(w) is not useful for 
our purposes; nothing interesting happens there, as w did not contain any copies 
of L4. However, the case of p\{w) is more interesting: we have pj(w) = [wi,w 2 ], 
and this time wi and w 2 are not only 1-complex but also 0-complex. 
Second, let w = ^LiL^ 1 . Now 

p}H = h2,S 2 ] [ 7 i,<5 1 ]X(l)[ 72 - 1 , ( 5 2 - 1 ] 

= [7271 6 2 x , 6261J2 1 ]. 

Again we were able to identify a projection p\{w) = [wi,w 2 ] such that wi and 
w 2 are 0-complex even though w is 1-complex. 
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Correctness. To generalise the above observations, we study the following 
cases that cover all possible (i — l)-complex words in Si. 

(a) For some k, word 10 is of the form 

w = sL k tL k u, 

where t does not contain any copies of L k or L^ 1 . It follows that we have 

P f( s ) = x(s') [ Sl , S2 ], P Ht) = [h,t 2 ], pHu) = [ Ul , U2 ]x( u ') 

for some s±, s 2 , ti, t 2 , u\, u 2 E W^(i-i) and s',u' £ {0,1}. Hence 

tf(w) = X(s') [ 8l , s 2 ] [ 7fc) 8 k ] X(l) [h,t 2 ] [ 7fc , 8 k ] X(l) [ Ul ,u 2 ] X(u') 
= X(s') [si 7 fc<2<5feUi, s 2 <Mi 7fc u 2 ] X(u'). 

Now if is an (i — l)-complex word in Si, then s\^ k t 2 5 k u\ is (i — 2)-complex 
in S'i-i: we have removed at least one duplicate element, as one slot of 
L k is replaced with 7 j. and while the other slot is replaced with 8 k . Hence 
Pi(w) 7^ / and w does not vanish. 

(b) For some k, word w is of the form w = sL k ~ 1 tL k ~ 1 u, where t does not contain 
any copies of L k or L^ 1 . It follows that w~ l = u^ 1 L^t^ 1 L k vT x , and the 
above argument shows that w^ 1 / I. Hence w does not vanish. 

(c) For some j / k and some a,b,c G { — 1, 1}, word w is of the form 

w = sL a k tL b jU L c k v, 
where t and u do not contain any copies of Lj, and LJ 1 . It follows that 

p?'(a) = X(a')[si,S2], Pl(t) = [h,t 2 ], 

p{(u) = [ui,u 2 ], Pi(v) = [vi,v 2 ]X(v') 

for some s±, s 2 , ti, t 2 , u\, u 2 , v\, v 2 G W^u^u and s',v' G {0,1}. Hence 

p>(w) = X(s') [ Sl ,s 2 ] bt,5i] [h,t 2 ] [ 7 },^]X(1) [ Ul ,u 2 ] [j c k ,8 c k ] [ Vl ,v 2 }X(v') 
= X(s') [ ai 7 £ t! 7) U2 «5£ V2, S2 8% t 2 8) Ul j c k «i] + 1). 

Again, if io is an (i — l)-complex word in Si, then si 7 £ ii 7 j u 2 5 k v 2 is 
(i — 2)-complex in <Sj_i: on slot of L k contributes a 7 & while the other slot 
contributes a oV Hence w does not vanish. 

(d) The above cases cover all reduced words w that contain at least two 
occurrences of any element of Sf, recall that in a reduced word, we cannot 
have L k and L^ 1 next to each other. The only remaining case is that all 
elements of w are distinct. Then w is 0-complex, and p J i {w) = [101,102] X(x), 
where w\ and w 2 are 0-complex. In particular, they are (i — 2)-complex, 
they do not vanish, and hence w does not vanish, either. 

This concludes the proof of Theorem 7. □ 
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Figure 9: Construction of Appendix B, in the case g = 3 and m = 1. We have 
h(ff) = 0, /i(l) = 4, /i(2) = 7, and h(3) = 9. There are n(l) = 8 elements in 
Si ; these can be interpreted as complete binary trees of height 4, as they are 
elements of group W 4 . In the illustration, a black internal node indicates the 
value 1 and a white internal node indicates the value 0. For the purposes of 
constructing S2 — {Li, L 2 , £3, L4} in Appendix B.5, we label the elements of 
Si by 71, <5"i, 72, 82, ■ ■ ■ , 74, S4. Now Lj is constructed by gluing together copies 
of 7j and Sj. 
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